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In the Claims : 

Please amend the claims as follows. 

1. (Currently Amended) A sca la b le method for imp le m e nting 
FFT/ I FFT comput a t i ons i n multiproc e ssor archit e ctures that prov i des i mproved 
throughput by e liminat i ng th e n ee d for int e r processor communication oftor tho 
computation of tho first "logaP" stages of tho FFT/ I FFT computations for a 
mult i proc e ssor arch i t e ctur o includ i ng an impl e m e ntation us i ng T" proc e ssing 
ele m e nts A method for controlling processing elements in a multiprocessor architecture 
to provide improved throughput for FFT/IFFT computations , the method comprising the 
steps of: 

Comput i ng computing, on a multiprocessor architecture including "P" 
processing elements, each butterfly of the first "logaP" stages of an FFT/IFFT on either 
a single one of the processing elements or on each of the "P" processing elements 
simultaneously; and 

distributing the computations of the butterflies in all the subsequent 
stages of the FFT/IFFT among the "P" processing elements proc e ssors such that each 
chain of cascaded butterflies consisting of those butterflies that have inputs and 
outputs connected togetherr are processed by the same proc e ssor processing element 
to thereby eliminate the need for inter-processor communication among the 
processing elements after the computation of the first "logpP" stages of the FFT/IFFT . 

2. (Currently Amended) A method as claimed in claim 1 wherein the 
distributing of the computation of the butterflies subsequent to the first "log2P" 
butterflies is achieved by assigning operand addresses of each set of butterfly 
operands to each proc e ssor processing element in such a manner that the butterfly is 
processed by the same proc e ssor processing element that computed the connected 
butterfly of the previous stage in the same chain of butterflies. 
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3. (Original) A method as claimed in claim 2 wherein the desired 
assignment of operand addresses is achieved by deriving the address of the first 
operand in the operand pair corresponding to the "i^" stage of the computation from 
the address of the corresponding operand in the previous stage by inserting a "0" in 
the "(i+1)^" bit position of the address, while the address of the second operand is 
derived by inserting a "1" in the bit position of the operand address. 

4. (Currently Amended) A method as claimed in claim 1 further 
including the computing of twiddle factors for the butterfly computations at each 
proc e ssor processing element by initializing a counter and then incrementing it by a 
value corresponding to the number of proc e ssors processing elements "P" and 
appending the result with a specified number of "0"s. 

5. (Currently Amended) A — system — ^ — obtain i ng — s ca l ab le 
i mp le m e ntation of FFT/IFFT computations i n mu l tiprocessor archit e ctur e s that 
prov i d e s — i mproved — throughput — by — e l i minat i ng — the — need — for — int e r proc e ssor 
communication aft e r the computat i on of th e first " l og^P" stag e s of th e FFT/ I FFT 
computat i ons for a multiproc e ssor arch i t e ctur e including an imp le m e ntation us i ng T" 
proc e ss i ng e l e m e nts A system for controlling processing elements in a multiprocessor 
architecture including P processing elements , comprising: 

a means for computing each butterfly of the first "log2P" "log^P" stages of 
an FFT/IFFT on either a single processing element proc e ssor or on each of the-T" 
processing elements proc e ssors simultaneously; 

an addressing means for distributing the computation of the butterflies in 
all the subsequent stages of the FFT/IFFT among the "P" proc e ssors processing 
elements such that each chain of cascaded butterflies consisting of those butterflies 
that have inputs and outputs connected together, are processed by the same 
proc e ssor processing element to therebv eliminate the need for inter-processor 
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communication among the processing elements after the computation of the first 
log^P" stages of the FFT/IFFT : 

means for counting; and 

means for computing twiddle factors for the butterfly computations at 
each proc e sso r processing elements « the means for computing initializing the means 
for counting and then incrementing the means for counting by a value corresponding 
to the number of proc e ssors processing elements "P" and appending the result with a 
specified number of "0"s. 

6. (Currently Amended) A system as claimed in claim 5 wherein the 
addressing means comprises addresses generation means for deriving the operand 
addresses of the butterflies subsequent to the first logzP" butterflies in such a manner 
that the butterfly is processed by the same proc e sso r processing element that 
computed the connected butterfly of the previous stage in the same chain of 
butterflies. 

7. (Original) A system as claimed in claim 6 wherein the address 
generation means is a computing mechanism for deriving the address of the first 
operand in the operand pair corresponding to the "i^^" stage of the computation from 
the address of the corresponding operand in the previous stage by inserting a "0" in 
the "(i+1)^^" bit position of the address, and deriving the address of the second operand 
by inserting a "1" in the "(i+1)^" bit position of the operand address. 

8. (Currently Amended) A system as claimed in claim 5 further 
including a computing mechanism for address generation of twiddle factors for each 
butterfly on the corresponding proc e sso r processing element . 
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9. (Currently Amended) A method of performing a fast Fourier 

transform or inverse fast Fourier transform on an input siqnal a p l ura l ity of i nput s t o 
g e n e rat e a p l ur a lity of outputs, th e m e thod b ei ng performed on a p l ura li ty of 
proc e ssors and e ach transform inc l ud i ng a p l ura li ty of stag e s conta i ning at l east one 
butt e rfly computational block , the method comprising: 

storing samples of the input signal in a memory: 

retrieving the samples from the memory and from these retrieved 
samples calculating the butterfly computational blocks for the first teq3 ^logpP stages 
of the transform on a single on e of th e processors or on a plurality of tJ=^e-processors 
operating in parallel; and 

eliminating the need for communication among and between the 
processors after the computation of the first "log^P" stages of the transform by 
calculating chains of butterfly computational blocks corresponding to the subsequent 
stages of the transform within each of the processors, each chain of butterfly 
computational blocks that is calculated in a respective processor having inputs and 
outputs coupled in series. ' 

10. (Currently Amended) The method of claim 9 wherein the first 
log2(P>-log2P stages of the transform are calculated on all of the processors operating 
in parallel. 

11. (Original) The method of claim 9 wherein the method is 
performed on two processors, and wherein the first two stages of a radix-2 fast Fourier 
transform or inverse fast Fourier transform are calculated as a single radix-4 stage, 
and wherein the subsequent stages of the transform are computed as radix-2 stages. 

12. (Original) The method of claim 11 wherein chains comprises a 
single loop that iterates N/2 ''(log2( N/2))/(number of processors) times. 
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13. (Original) The method of claim 12 wherein each butterfly 
computational block includes a plurality of operands each having an associated 
address, and wherein calculating chains of butterfly computational blocks 
corresponding to the subsequent stages comprises assigning addresses to each of the 
operands so that each butterfly block in a chain is calculated in the same processor. 

14. (Original) The method of claim 13 wherein each butterfly 
computational block includes a pair of operands, and wherein the operand addresses 
of these operands are assigned by deriving the address of the first operand in the 
operand pair corresponding to the "i^" stage of the calculation in the chain from the 
address of the corresponding operand in the previous stage by inserting a "0" in the 
"(i+1)^^" bit position of the operand address, and deriving the operand address of the 
second operand by inserting a "1" in the "(i+1)^*^" bit position of the operand address. 

15. (Original) The method of claim of claim 9 further comprising 
initializing a counter and then incrementing the counter by a value corresponding to 
the number of processors and appending the result with a specified number of "0"s to 
compute the twiddle factors for each butterfly computational block. 

16. (Currently Amended) A processor system, comprising: 
a memory operable to store samples of an input signal: 

a plurality of processors coupled to the memory, the plurality of 
processors operable to receive the samples from the memon/ and to use the samples 
to e x e cut e a fast Four ie r transform or i nvers e fast Fourier transform algorithm on a 
p l ur al ity of inputs to g e n e rat e a plurality of outputs, e ach transform inc l ud i ng a plurality 
of stag e s conta i ning at l e ast on e butt e rfly computational block, and th e proc e ssors 
op e r a b le to execute the butterfly computational blocks for the first " l oq2P" "loogP" 
stages of the -a fast Fourier transform or inverse fast Fourier t ransform on either a 
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single one of the processors or on a plurality of the processors operating in parallel; 
and 

address circuitry coupled to the memory and processor and operable to 
distribute the computation of the butterfly computational blocks in all stages 
subsequent to the first teQ2P-log?P states -staqes among the plurality of processors 
such that each chain of cascaded butterfly computational blocks in the transform are 
coupled in series and are computed by the same processor to thereby eliminate the 
need for communication among and between the processors after the computation of 
the first 1oq 2 P" stages of the transform . 

17. (Original) The processor system of claim 16 wherein the address 
circuitry is further operable to derive operand addresses for each of the butterfly blocks 
subsequent to the first "log2P" butterfly blocks so that each of the butterfly 
computational blocks is computed by the same processor that computed a butterfly 
computational block of the previous stage in the same chain of butterfly computational 
blocks. 

18. (Original) The processor system of claim 17 wherein each 
butterfly computational block includes a pair of operands, and wherein the address 
circuitry assigns operand addresses of these operands by deriving the address of the 
first operand in the operand pair corresponding to the "i^" stage of the calculation in 
the chain from the address of the corresponding operand in the previous stage by 
inserting a "0" in the "(i+1)'^" bit position of the operand address, and deriving the 
operand address of the second operand by inserting a "1" in the "(i+l)^"" bit position of 
the operand address. 

19. (Original) The processor system of claim 17 wherein the 
processors further comprise a counter that is initialized and then incremented by a 
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value corresponding to the number of processors, an output of the counter being 
appended with a specified number of "0"s to compute twiddle factors for each butterfly 
computational block. 

20. (Original) The processor system of claim 16 wherein each of the 
processors comprises a digital signal processor. 

21 . (Currently Amended) An electronic system, comprising: 
a processor system, including, 

a memory operable to store samples of an input signal: 
a plurality of processors coupled to the memory, the plurality of 
processors operable to receive samples from the memory and to use the samples to 
e xecut e a fast Four ie r transform or inv e rs e fast Fouri e r transform a l gor i thm on a 
p l ural i ty of inputs to g e n e rate a plurality of outputs, oach transform i nc l uding a plura l ity 
of stages conta i ning at le ast on e butterfly computat i onal b l ock, and th e proc e ssors 
op e rab le to e xecute t he butterfly computational blocks for the first " l og2P" "loogP" 
stages of the -a fast Fourier or inverse fast Fourier t ransform on either a single one of 
the processors or on a plurality of the processors operating in parallel; and 

address circuitry coupled to the memory and processor and 
operable to distribute the computation of the butterfly computational blocks in all 
stages subsequent to the first teggR-loggP states -stages among the plurality of 
processors such that each chain of cascaded butterfly computational blocks in the 
transform are coupled in series and are computed by the same processor to thereby 
eliminate the need for communication among and between the processors after the 
computation of the first "log^P" stages of the transform . 

22. (Currently Amended) The electronic system of claim 22-21 
wherein the electronic system comprises a communications system. 
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23. (Currently Amended) The electronic system of claim 20-21 
wherein each of the processors comprises a digital signal processor. 
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